SEQIDN0:16 

Rat Smooth Muscle Myosin Heavy Chain Gene Sequence (-4,216 to +1 1,795) 



Nucleotide 1 corresponds to -4,216 bp relative to the SM-MHC transcription start site 

AGATCTTAAA ACACATCAAC CTGGGCTGAG GGGATGTGTG TCTCTGTGTC TGTGTATGCA 60 

CATGCATTTG AGGCCAGATG AAAATGTCAG ATGTCCTCTC ACTGCTTTAT TCCCTTGAGA 12 0 

CAGGGTCCCT CACTGAACTT GTTGGAGCTA TGCTGGTAGC CAGCAAGCCC CAGTGGCCTT 180 

CCTGTCTCTA TCTCACACAG CACAATATGT GTGGCCATGC TCCACTTTTT TACATGGAAA 240 

TTGGGGTCTT CCAACTGGGG TTCTCATTTG TGCAGTGACA CTCTTCCCCA CTGAGCCATC 3 00 

TCCTCAGGCC AGC TGATATA TTTTTAAATA ATTAAATATT TAGCACATGC CTTTAGAAGC 3 60 

CAATAGCTAT TTAAAGCTGT TTGCTTAAAA AAAAAAAAAA AAAAAAGACT TCATTATCCC 42 0 

AACACTTATG AGGGAGAGAC AATAATTCCA AAACCAGAAC CAGCCAGGGT ACACAGTGAG 4 80 

ACTTTATTTA AAAAAAAAAA AAAAAGAAAG AAAGAAAAAA AAAAGAAAAA GAAAAAAAAA 540 

GGCTCCAAAG AGAAATTTCC CCTTCATCAT CTAATCACAA GAAAACAATT TATTTATTTT 60 0 

GACATCACTC AGTCCAAAGG AGCTTTTTGT AAAGTGACTT CTCTTCTTAA AATAAGTGAC 66 0 

CCTTCCCAAC CACCAAAAAC AAAACAGAAA CCTCTGCCCT GTTCTAGAGT CCTTTTGAAG 72 0 

ACTTCAGATA CCTGAAGAGT GGACAGATAT TTACCGAGTG ACTTAAATGA ACATACTGTC 780 

CCTGGGTACT GCTCAAGCAT GCCAGGAGAG CATGGATGGT TTATGCAAGG CTGGCACTGT 840 

CATTAACAAC TCAGTAAGGC GGAGAAGACA GAGAGCCTCT CCTAAGACAA TGGCACATAA 90 0 

GGACATGGGT AACCCCAGAG GTTCCCGGCT AGTACTTAGC AGAGCTGAGA TCAGACTTGG 96 0 

GCCTCTGTGC TCGCTTGCCT AGTGGGCAAC ACTCAAGACT GGGGTAAACA ATAAGTTGAT 102 0 

CTGGGATATG GCTCAGTAAT CACACTGAGA ATTCAACACT GGGAAGGCAG AGGAGGATCC 10 8 0 

CTGGGATTGC TGCCTGGCTC TCTAGCAGCC TAGCAGAATC AACAAACTCC AGGTTCAGTG 114 0 

AGAGATGCTC ACAAAATAAA ATGGAGGAGC AACTGAACAC ACTCAGTGTT GACCCACACA 12 00 

CACACTAAAG AACACGTGTA CCACACAGAC ACAGACACAG GATAACCTAC CCATGTTGTG 12 60 

TATGGACTCA GCCAGCCCAG GTTGGAAACT CAGTTCCTCT GTTAACTCTT TTCAAACCTG 132 0 

GGTCCTCAGC GATGTGCTGG GGAACCTACT TCACGGCATT ATTCTGGGCA TTAGATGTAA 13 8 0 

AGGAAGCAGT AAAGTTTCCC TTTTCTTGAC TGAGGTGATG CGAGAATGAG GGCCTGAATT 144 0 

CCATCTCTAG GACTCACATA AAGACACCCA GACTGCACTG GCCAGTAAGC CTCACCTATG 150 0 

CCTCCAAGCC TGGCTGTGAG AGACTGTCTC AAAAACAAAG TAAAAACAAC AAAATCAATG 1560 

TCAGATGTGC ACACATCGAA TCCCAGCATG TGTAC GGCAT GCTTGCAGTC AGCCTTGTTT 162 0 

ACAGAGAGTT CTAGGCCAAC CAGCTATACA CAGTGAGACC CTGTGGTAGA CGGCTCCTAA 1680 

GAACTGACAT TTGTGACTGA CAGATGTGCA CATCTACCAC ATGCACATCA CAGTTTCCAT 1740 

TTTACAAAAA GGTTAACACT TACTAATTGA TTAGGGAGTG GGGCACCCCA CTGCTACATG 1800 

TGAAAGCCAG AGAATGATGT GTTCCAGTCG GTCAGTTGTG TCCTTCCACC ATGTAGGTCC I860 

TAAAAATGGA ACTCAAGGCA GTCTTGGCAG CAAGTGCTTT ATC CATAGTG CCATCTTATT 192 0 

GGCCCAGTCT CCTTATAATG AAATTATTTG TGTTTCCAAG TTGATGTAAT TCTTTAAAAA 1980 

TCAGCTGTGC TCCTTGGAGT TTGACTTCAC TGAAGCCTGC TACAGGAGTG CCCTTCCTTC 2040 

CTAGCACTAG GATGGCCAGC TCTGGGCTGG TTTCAGACTA GGGTAGGTGC AGGTGGGCCC 2100 

TGGGCTTCCC TCCTTCATTC CTCCTGGGCT CAATGCCAAG CCGGTTTCCA TTCCTTTTAC 2160 

GTGCACTGCG AAGAGGCTTT GGGGAAGCGG CCTCATCCAT CATGCAGAGA GCTCCTCCCC 222 0 

CACCTCTACA GAGAGC CAGC CAAGCTGCTG TCCTTGGCTC TGCTCTGTCC ACCCTGTGAG 22 80 

GAGGCTGGGA TGAGGTTGGG GATGGGGAGG ATCAGGATTC AGATGTTTTC AAGTCTGAGA 2340 

AGCAGGTGAG CTTGGTCCTA GAAGAATATG GAAGGGGTCT ACTGGGGTTG AGATATAGAT 24 00 

CACTGTATCA AAGTCAACAG GGGGGCTGTG TGGCTTTTTC ATATCCCAAA GTCAGCTTGG 24 60 

TGCTGGTTTC CTAGGCTTCC TGAGTCCGAC AAAGGTGCAG TGTGTTAATC TCACACCACT 2520 

TCAAGGACTG TTACAAAAAA AAAATAGGAA GGAGCTCGAT TCGCCCCTTT TTACAGGCAG 2580 

GGTAACTAAG AGCCAGTACT TGCCCATGGT CCTGCTGTTA TAAAGAGGCT CAGTAGACTC 264 0 

CCATTCAAAC AACTGTGCTC AGAGGCCTTC TGTCGTC CTG TGGCCAATTC CCCTATTGCT 270 0 

CTCTGGAGTG AATATTGGGA TATTAAACAG TACTGAC CTT GCTGAGGACC CTCAGGGTAC 2760 

TCAGCTCTTC TGGCCTGCAA AATGGGGCTG GGACAGGTTG GCCAGGATCA TCCTCTGGTT 282 0 

GGGAGAACCA GCTGCACGTG GGTCTGGAGC TCTTATTAGT ACTGGGGTCC CCATAACGCT 2 880 

CCATGGGCTC AGCGGGAGGC TGCACGGGAC CATATTTAGT CAGGGGGAGC CAGAGCCCCG 2 94 0 

CTGGTATGCC AAGCTGGGAA TTCTTGTTTC GAGAATTGCG CCTGGCCTTT TTGGGTTGTT 300 0 

TCCCGCCCAG GCCCAGGAGG GAGGACCAGC TCAGGACCTC GAGGGTCCGT GCGCGGGGAG 3 060 

CGAGGCGTCC CCGGCCTGGC ATGAGGCCAA CTCTGCCTCG ACTTCCTTTT ATGGCCTGAG 312 0 
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TGTGAGTGCA TGGAGAGTGG GAGGGAGGGA GGGAGAGAGG GAGGAAAGAA AGCGGGGTGG 3180 

GGGGGTGGGG GGGTGGGGGG GTGGGGGGGT GCGGAGAGCA GAGACAGAGA CAGAGAGACA 3 240 

GAGAGACACA CAGAGAGAGA CAGAGAGACA GAGAGACACA CAGAGAGAGA CAGAGACAGA 3300 

CACACACAGA GAGAGACAGA CAGACAAAGA GAGAGACAGA GACAGAGAGA CACACACAGA 3360 

5 GAGACAGACA GACAAAAAGA GAAGAGAGAC AGAGACTTTA GGGACGTAAT CATCACAGGG 3420 

AAATCAAAGC TAAGAGTGTG ATGAAAAGAG TGTCAGGTCA GACAAAAGAG AC AGGGGC CA 34 80 

AGATCCGTAC AGGGCTAAGG GACACAGAGA TTGAGAACAC CGAGTGGTAA GGGGGGCAGC 3 540 

TGACAGCAGG TCCCCCACAT TCTCTTAGAG TCTTAGCATG CATCCTCCAA GTGC CATAAC 3 600 

GCAGTAGCAA CCCGCTTTTC AACGATGCTC AGAGAAACCA TGTTATTGGT CCCAGGCACC 3660 

10 CCGGTTGTAG GGTGAAAGGA GCTGCAGAGA ACAAGT TGGA AAAACAAGTT TCCCAGCAGT 3720 

CACAGAGGAT ATGCAGTGAC TGTGCCGACT TGTTTTTTTT TTTTTAAGTC CCCTTCCCCC 37 80 

CCCCCGCCCC GCCCCCGGCT TGCTAAGCAC AACCGGCTTC GAATCTTAGG AAGTGGCAGG 3840 

CGAATGAAGA GGGGATGAGG GAGAGAGGGT GGCATCAAGT CTCCAGTATG TATGAACAGA 3900 

AAGAGGTTAA AATCCAGCTG GAATGGACCT AGGGGAAGAA ATTCTCAAGT CTCCCTACAG 3960 

15 ACTCTGAACA CCGAATCCCT TTTCTCTAAG GACGCAGGAT CTGGGTGGCT GCAGGGAGCG 4020 

AGGCCTGAGG CTGTGGGTCA ACTTGCCAGC AGCCCCCCTG CGCCTGCGCT AGGTGGTTCC 40 8 0 

CAGAGGCTCT GTTCCTCACC TGCAGGGGGC GCTGGGAAGG GCAGAGGACC CTCCCACCCC 414 0 

GCCCGGCAGT CACCTCCCCT TCCCCACCCT CGGGTAG CGC TGACTCTATA AAGCCAGATG 42 00 

I — ► transcription start site +1 

2€U TCCGAAGCAT ACAGAGAGAT TTGGACCATC CCAGCCTGGG ATCAGTGTCA GATCCGAGCT 4260 

« CTCCATCCGG TGTTCTCCTG CTAGTCCACC CCAGTAGCAG ATCTGTAAGT AGAAGTTGAT 432 0 

S CCCTTAGGGG CAAGCCTGGG CGGTGAGCTT GAGCAGCTTC TAAAACATCC TCCAGGGAGT 43 8 0 

GGGGACCCCA AGGGGTTCTG ATTGTCATCT CTTATAAGGA CAGTGGGAAG AAGCCCGGTA 444 0 

^ CAGGACCACC CTAGACCTCC CGTGATTACT CCCATTCTCC GCACCAAACC AGCATCCTCA 450 0 

25^ GGTTGCCTAT GAACAGAACC AC CTGGGAAA GTGGGGTAGG TAATTAAAGG TTCTGGCCAC 4560 

M TGGGCCCAAT TCCAGGTATT TTAAGACTAC AGTCTAAAAA GCAAACAAAA TGGCCTACTT 462 0 

fU AAAAACTAAC TAGTGACACA GTGGACAAGT GAACTGTGGT GGAAACTGTG GGTCTGAATT 4 680 

01 C AAATAC CAG TATTGAAAAT AATAAGAAGT CTGGGATAAA TATCCACTGA ACATCCCCAG 4740 

, AATACTCAAA ACATGGGTTA AAGTTTAATG ACTCTGAACA CAGGCCGTGT GTTCTTATTC 4 800 

30t CACTCCTAAT GGAATGTGCT GTTGAAAATT TACTGGTAAA CAAAAATGCT TAATGTTAAA 4 860 

f TAAGGTCGTT TCTTCCTCTG TTACTTCCAA AACACAAATC TCCATTAAAA AGGAACCTTC 4 920 

P TCCAGTTTGG TTGGGCCCCC AGATGCCCAG GTGGGTGCTG AGGCTCCATT TGCATCCCCC 4 980 

S :| ACACTGAGTG AGCAGACGAT GGATTTTGGG GCTCCTCAGT GGGAAGGTTA CTCTCAGGTC 5 040 

AGGGAGAGGA GCTAGCAGAG AAATTTATGC TATTCCAGTT CAGAATTGGA GAAGTCTTGC 5100 

3p C ATGTC CAGA AAGCACCCTT CAAAGTTATG TCTGTCAGAG AACAGAAAAA TTTTTTTTGA 5160 

Ell AAGCCAGGAC AAGGCTGCTT TGGTTCTACT ACTAAGAACT GAAAAACTGC TGACTTGCTG 52 20 

GGAAAGAAGG AAATCCGGTT GTGTTTGGTA AACTACTCTG CTTCGTTGGT TTCCTGGGGG 52 80 

AGGTTTTTTT TTAGTTCAGT AATTCAATAT GCTATTTTAG ACTCAAAGAA AGACAGGTCT 534 0 

GAAAGTCTCT CATAACAAGA AACACTTTCT CTTTTATGAT GTTGTTGATG GC AC ACT TAA 54 00 

40 CAAGCCAGGT GCTTTAACAG CGTTTAGATG GAACTGGGTT CTTTTAATCA TCATATACAC 54 60 

CTTACCTTGT CTTGACATCT CTGTTTTTCC CAAAACCAAA ATTTGTTGGA CTCCTGTTTC 5520 

TGATGGATTC AGTGTTTCCA GCTTCCATCA CTTTTTGAAG AAGATTGAAA CTGATCTTTT 5580 

ACCAATTTAA AATGACAGAG ACTGTCTTTT AAATTTTGTT GATGTTGTTG TTTCC CTGTG 564 0 

GATGTGGTAG GGTTCCAGGA GGCTGGCGTG ATCTCAAACA TGCCTGGGCC AAGCCACCCT 57 00 

45 GGAGAAACCT GGACTTTTAT TATCAGATCT GAAATAGAGC CTCTTCCGTA CAAGGTAGTC 57 60 

ACTATGGATT TATCATTACT TTTCTGTGGG AGGCTGGGCT GGAGGCAGAC ATGCCCTTGT 582 0 

ATGGTAGTGT TTTCTATGAG GCCATTCCCA GTCCCCCTTG GCCAATCACC CAGCCTTTCG 588 0 

ATGCAGCCTG ACTGGCTTGA GTTCTGGGTA CTTCTCTGTC TTTCCCTGTA GAGATGGACA 594 0 

ATGAAGTTCT TTTTTTCCTC TCTTTTCTTG TTTGGAAGTT CTATTTGTAT TTTTTTGGTG 6000 

50 GAAATTATAT TCCACATATC TAATAAGAAC GGGTGGTGTT TACATCTAAT AAACCATTGA 6060 

ATAATTTTGA AACAGGATAA AGACGATCCT TTTAGAAAAC TATATCCCGT TTCAAATACT 612 0 

CAGAATCAGG TCTTAACCAC ATTATTTTGC CAGGTATGGT GGCTTGTGTC TAAAATACTA 6180 

GCACTTGGGA GGCTAAAGCA AGAGAGTTTG AGGCTAACCT GGACTGCATA GCAAGTTCAG 624 0 

GCCATCCTGG AC TAC AGTGG GAAACACTAT CTTGGAAAAA ATAAAAAATA AAAATCAAAA 63 00 

55 CCCAGCCTAA TGGTACATAA CTTCAATTCC AGCATCTGAG GTAAACCAGG AAGCACAGCT 6360 

GATTAATGAA CCCAAAGTCA GCCTGGGCTA CCTAAGGAAT CCTATCTTTT ACAATTTGTT 642 0 

GATGCTGTTG TCATTTTCCT GATCACTTTC CCATCTGCAG AATGGGACTG TTGAGAACAG 64 80 

CCAGCGTGTT AATGTTTCTG TAGCACTTGC TTAGTCTTCT GAGAAGTAGA AGATCACTTA 6540 

GCTAGGGTTT GATCCCCATG ACTGCAGCAA AAGAGGAAGA CTCATTAATT GGAGTCTTCA 6600 
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CAGTAGCCCT TGGAACCAAT ACTAATAGTC TTCACTCCAT TTCATAAATG TGGGCTTTGA 6660 

AAACTTTGTT CTGTCTATAA AAGATGGGGG CTCTTACAAA CTAAGCTTCT TGTAACTCCA 6720 

GAGCCTAATG CCCTTTTGGG AGCTTTCAAT AGATAACCCA TGTGAAGGGT CTGACACAAG 67 8 0 

GCTGGCACCA GCAAAGTTCA GCAGATGGTA ATTTATAGTA ATATGACTAG GGACGCTTAA 684 0 

5 GAGCATATTC TGTATGACAC AGCTGATATC AAGAAACCCA AACGGTGGCC TTTCCCCTAA 6900 

AGCAGAAACT CACCCCTAAT TTTCCTTTAG TGTAAATCTC ATAGTGGATT CTTTGCTCCC 6960 

TGGTTCTCTT TCTGTCACTA GTGACCTTTT AGTTACATTG ATCTATAGGC TTCAAGGACC 702 0 

AGGAGGCACA GAGTCAAGAG AAAGGCAAGC AAGAATTTGA AGGGAGAAGG AAACCGCTCA 70 80 

GCACTGTAGC AAGGGGAGGT CAGGCTACCA TGATGCTCCT GCGCTTCAGG GAATTATCCT 714 0 

10 CTCAGAATGG CCAACAGGGT AGGGACCTGG CCTGTTCCAC TCAGGCCCAT TTGAACTTTC 72 0 0 

TTTCTGTTCT ATGGGTCCCT ACAGATGAAT TCAGCCCACT GTAGACTGGA AGTTCATCTT 7260 

TAACAGCATC CAAACGGAAC ACATACAGAC CTTCTTTCTT GTCACTGTCC CTGAGTCAAG 732 0 

CAGCATAAGA ACTATGTCTG CCAACCTGCG AGGGGAAGTT GCTCAAGATG CTATGCAAAC 73 8 0 

ACTCCAGCTT TCCATGGAAG GGACTTCAGC ATCTATGGAT GGTGGTAGCA AAGCACTCCT 744 0 

15 CAAGCTGATC AAAGAATAGC TGTCCCTTCC TGCCCCTCCC CTAATGAAGC GTGCAGTCAG 7500 

TGACAGAGAC CTCAGAAATG TCTTAGGTCA CCAAAGGTCA TTCTTGCCAT CCCAGGCTCC 7560 

AGATTAGCAT TTTCTCCCTT TTTATTTCCC TCCATTTTGC CTGTCTGCAT ATGCACTACT 762 0 

AACAAACATT CTTTCTTTCT TTTTTTTTTT TTTCTTGGAG CTGGGGACTG AACCCAGGGC 768 0 

CTTGCGCTTG CTAGGCAAGC GCTCTACCAC TGAGCTAAAT CCCCAGCCCC GCTAACAAAC 774 0 

20.. ATTCTTAAAT AGAATTCTAA ATTTTTTAAA GTCAAATTTC CCTTTTACTC AAACCCTGGC 78 0 0 

r*.' ATTTTACAAA ACATTTTTCA CCTTATCACA AATCTTCACT ATCTTTTCTA TATCTTTATA 7 860 

P TCATTGTATG TTACTTTTTA TCTGCTACGT AGTATTCTGT TACGTATTTA ATAAAATATA 7 92 0 

P CTTGGTGCAT GATGCCATGT ATAAATGGCG CTTGGGGAAG TACCCGTGTA CTAGTTGACT 7 980 

til GTTGCCCATC AGAAATGCCC AGGACCAGAA ATGTTCCAGA GTTTTCTTTT CTTTTAAATT 8 040 

254 CTTTTTGATT TTGGGATATT TGCACATAAA TAATTATATA TTTGTATATA AATAATGATA 8100 

SI TATCCTGGAA ACGAGCACTA ATTCTTTTGT TGCCTGTCTT CTGGGTTTTT TTTTTTTCTT 8160 

jtj TCCTTCTTTC TTTTTGTTCT TGGCCATCCT GGAGCTCTCT GTAGACCAGG TTGTGCTTGA 822 0 

j|l ACTATAGAGA TCCTCCTGCC TCTGCCTCCC ACATGCTAAG ACTAAAGGCA AGAGCCATCA 82 8 0 

CACCCATCTG TGAGCACAAA TCTTGATATT TCACCTTTGC TTTATACAGA TGGTTGTATA 834 0 

3jL GTCAGTCGTT GTATTCGATG TTTTTAATTC TACATTTTCA CTGTGACCTG CTACATGAAA 8400 

P TTCAAATACA AACTTGTCCA CTCACACAAT ATTGGCCCTC AAAAAGCTGT GAGCCTTTGA 8460 

H ACTTTTGGGG TTAAGAATGT TTAGCTTGTA TCCGTATTCT TCGCTTGTAA ACTCTCTTCC 8520 

fit TGTAATCACA TGAGTTCCTA GCAAAGAGGT GAATAGATAG CACATTGGGA ATCAGCATCT 8 580 

i» GTCTCTAAAT GGTCTTTGAA AGAAACTGTA GATACCTGCC TGGACCAGCC AGACCTGTGT 8 640 

3jS CTTAGCACCT ATTTTAAACA TTGTTCTACC TGAGTTGTAA GATGCAAAAC ATAGTGGGGC 8700 

fjj TCTGAGGGCC CAAAGGCCCT GAACAGGGGT GACCTCAGTT GTGTGGAATA GGGAGAAAGA 8760 

CAGCAGAAGG AAGGGAGGAA AGACGGGCAA GGAGGGGAAG GTGTTCATGT GTATGGCTGC 8820 

ATCTAAATAG AAGCCATGAA GACTAGCTAT TGTTTCTCAG GTCCTTCCAA CTTGCTTTTG 88 80 

GAGACAGGAA CCCTCACCAG CCTGGAACTT GCCAAGTAGC TAATTGGCTG GCTCTTGACC 8 940 

40 CCTAGATCTC TTTCCCCTCC ACTCTAACGT TACAACATAC AGCTCTCTCT CTCTCTCTCT 9000 

CTCTCTCTCT CTCTCTCTCT CTCTCTCTCT CTCATTTTAT TTTTTAAAAA AAATTTATTT 9060 

ATTTATTTAT TTATTTATTT ATTTATTTAT TTATTTATTT CATGGATGTA ATACCTGTCC 912 0 

TGTCTCAACC CCAAAATGGG CATCGGATCC CATTCCAGAT GGTTGTGAGC CACCATGTGG 9180 

TTGCTGGGAA TTGAACTCAG GACCTCTGGG AGAGCAGTCA GTACTCTTAA TGCTGAGCCA 9240 

45 TCTCTCTAGC CCTTTCCCCC TCTTCTAAAA CATAGTTTTT GAAGATCTAA CGCAGATCTT 93 00 

CAAGTGTCAG TATGGCAAGC ACTTTGCTGA CTCACCAGCC CATGACCTTC TCCCTTAATC 93 60 

TCCAAATCCT TTTAGTGGGA GAGACACAAT CGTTTTACTT TAGCCATTGG AAAGAGCTTC 942 0 

CTTCTAAAGC AGCTTGAAAA GCCATTGGGG TTTCCAGCGT GTGTGTGGCA GTGTT AC C AG 94 80 

GTTATTGTGA TGGGACAAGT TCTTATTCTC TTTCTTCTGA GGAGGTACCC TGGAGACCTT 9540 

50 GGGGAAGTGG GGGTGGTAGG GAGGTTTATG GCATTGGGGC AGGGAGTGAA GAAGAGATTT 9600 

ACTGCTGAGA GCAAAAGGAT TGTTAGATCC AACAATCTAA CAAAAAAGGT CAAACTTTTT 9660 

TTTCTTTTAT GAC CTTAGTT GTGATAACAG AAAAATAGTA ATGTAAGTGA TGTCCACTTC 972 0 

ACAGAATCCT CATAAGATAT TCAAGACCAT AAATGTGGGC CACTCTTACT TTGATGCCCA 978 0 

GTAGGGGGCC CCTGAGCAGA TGCAGCTTAG TTAATAGGAT GCTTGCCCAC CATGTTTTGT 984 0 

55 ACATGTTCCA CCCTCAGTAC ACAGCCAGGC ATCGTAGGAA AC AC TTGTAG CCCCTAGCAC 990 0 

TTGGCGGGAG GACCAAGAGT TCAAGTCCGT TTTTGATTAT GTAGTGAGTT CAGGGTTAGC 9960 

ATGGGCTATA GGAGACTGTA GAGGGCTATG TGATTAAGAA CAGATTTGAG CCCCACAGGG 10020 

CTCCTGGTGC AGCATGAGTT TGAGGAACTA GTGTGTATAG CATGCTTTTC CTTCTTCTTG 10080 

GTATGTCAAG TGACTTTCTA GACGCAGATG TGGCATCGAA CTAGAACTAA C ATTATTG GG 1014 0 
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GCCTCTTTGG 
GCATGGTGGT 
CCTGGTGGAT 
TCTTCAGTAC 
TCAGATAGTA 
AGAACCTGGA 
TTCCAAGCTA 
CTTTGGTTTC 
CTCCTTTTGA 
ACTGAACAGC 
CCACTGTAAA 
CTGTCTTGTG 
ATTGAGATTT 
CCCAACCTAT 
TCATCTCCTT 
GTGTTTTACG 
AAAATTTCAG 
AACTTCCAAA 
TCCTTGAGTT 
GACAATAAAA 
CTGCTTTGCC 
GGTTGTTTTG 
CTAAACTATA 
GGGGATGGGA 
CCACCTCCTA 
ATTAGTTAGT 
AACTATTACA 
GTGTGTGTCC 
GGTTTTCTGG 
GTGATGGGCA 
ACATGAGTGT 
TGATGTCCCT 
TTTATGTCAG 
TTATCTGCTG 
CCGTTCCCCA 
AAAGTTTCAG 
GTTGTCAAGG 
GAGACACAAA 
CCTGGAAAAG 
TAACTTAATT 
AGGCCTCTGA 
AATAT AC TTT 
AAAAAGAATT 
AGAGGGCCTT 
AGGACCTCTG 
TTCCTTTTTT 
GTATGGGCTC 
AGTCAGAGAC 
AACTCAAGTC 
TCCTCACTTT 
CACACAACAA 
TAGGATGACT 
CTTACCATAG 
GCACTCAGGA 
GGATCTCATC 
GCTGGAGGGG 
GTACCCCACA 
TTCAAGCCTT 
ACATAAAATA 



ATTGCTTACT 
AACAACTACA 
ACAGCTCAAT 
CTCGAATATT 
GAGTGTCTGG 
TGTGGTAGAG 
CATAATAATA 
AAACTGGGGA 
TGTCATGACT 
AC CAGAG AAC 
TCAGGAGTCA 
GGTCTTTAAT 
GAACTAGGGT 
GTGGTTGTTT 
CATCATGGGA 
CAATACCTGT 
GCTCTGAACA 
CTGCCAGACA 
CTCTGCCTGC 
AAACTTGCAA 
TCTGTGTGCC 
CAATGCTGGG 
TTCTCTAGTC 
ACCCAGGGAC 
GGACTTTGCT 
TAATTTAGGG 
AGCATGCTAG 
AGTATTGTTC 
TTGTAAAATC 
AAAAAGATGT 
ACATGCTTGA 
AATCCTTCTC 
CTACTGGGAT 
TCTTTGTCAT 
AACCCAAACA 
CTTCCTTGGC 
GCAACAAAGC 
CTTTGCACTT 
CTAATCTCCA 
GCAATTTAAA 
GTGACTAAAG 
ACACTTCTTA 
ATTTATTTTA 
AGATTCCATT 
GAAAAGCAGT 
TAAAAAAAAA 
AAACATGATA 
AACTTGCAGA 
ATCAGTTTTT 
ACCTTTTTAT 
CGAAAAGATC 
TTGAATTCCT 
AATTTAAGAA 
AGAAGCTAAG 
AAGAAATCAA 
TTGGTTAAGA 
TGGTGGCTCA 
CTCAGGCAGG 
AATAAATTAA 



GAGCTGCAGC 
CTACAGAAGA 
GGGAGAACAC 
GGCCAACTAA 
CTAGCATTCA 
TACATCTATG 
CAAGACCAGC 
CAGTTTTCCC 
AGGAAATGGT 
CAGCCAGTGC 
GAACAGGACC 
GGCATTGTGA 
CTTGTGCATG 
TGTATCAGGG 
CTACATATGT 
C CTGATATTT 
GAAATGAAGC 
GAAGCAGAAT 
CTCCTCTAAA 
CTTTTTTCAG 
TTGGTTTCTC 
GATAGAATCC 
TTAAAAGTAT 
TGTAGCTTAC 
GTTTATTTAT 
GATTAAATGA 
TATCATTAAT 
TGGGAACCCC 
TGCCAAGAAC 
AGTGTGTGTG 
GCCCTGAAAC 
CAATACTAGG 
TCAAACTCAG 
TATCAGCTGG 
TTCCTGGGCA 
TTTGACCAAA 
CTCAGGGAGC 
GATCTTGTTT 
CGTCTTTTCA 
AAGAGGCAGC 
AC CATTTGAA 
ACTATGTGCC 
CGTATTTGAG 
ACAGATGGTT 
CAGTGCTCTT 
ATAAATGAAA 
TATGTGCATG 
AGATGGTTTG 
GTGCCAACCC 
TTAAAAAACA 
TCATGTAGCC 
AATGTTCTTG 
ATTATCTAAG 
GCATGATGAT 
CATTCAATTT 
GCACTGGTTG 
CAACCATCTG 
TGCATAAGTA 
AATTTTGAAA 



TTTGGCTCCA 
CTACTACTTT 
ATATTGAGCA 
AAGGAATGAA 
CAAAGCCCCA 
ATCCCAGCAC 
CTGGGCTACA 
TCTGGGAGTG 
GGATACTGGC 
CAAGGCCAAT 
AGGAGTTAGA 
TTTTGGTTCT 
CTAAGTAAGA 
TCTCTCCTTG 
GAGCAGTTTT 
CTTGCTGTAT 
AAATCTTCTG 
GTGTCCTCTG 
TTCTAAAAAA 
AAGCCACAAG 
ATTGGTAAAG 
AGGGCTTAGA 
TGTTTGCATT 
TAAGTGTTCT 
TTATTTAGTT 
GAGAGTAATT 
TTGTGGGACT 
TCCTTCCCTG 
TGGAATGTCC 
ACATTCATGT 
AGGATTTCTC 
TTGTAATAGT 
GTCTGGACAG 
GTTTAAAAAG 
CCAGGGTTCC 
GCAGTCTTGT 
AGCCAGATGA 
GTGCTTTTAA 
TTTTTCTGTT 
TTGCTGTCCA 
ATGGGTCGTC 
TAAAGTATGT 
TACACTGTAG 
GTGAGCTACC 
AACCACTGAG 
AATTAACTTT 
GGCTCACACA 
CTCTTTTCAT 
CCTTTACTCC 
AACAAACAAA 
CCAGGGTGGC 
CCTCTACCTC 
GTAAAGTGGT 
TGTGAGTTTG 
TCAATTATTT 
GTCTTCCAGA 
TAATTTTAAT 
CACAGTCATA 
GTTTTTTTTG 



AGAACTTATT 
GAGACCAGCC 
TGTACAAGTC 
TTTAGGGGTG 
AGTTTGACCT 
TCAGGAGAAC 
CAAGATCTTA 
ATATCTAGCA 
ATAGAGTGGG 
AGTACAGGGG 
AAACCAAATG 
AGTCATCATT 
ACTCTGCCAC 
TAACCCAATA 
ACTGTTTTTC 
TGTCACTGTC 
AC AG TAAATG 
TATGCCTGTA 
AGAAAGAGCA 
ACTGTAAAAG 
GAATGGTAAC 
GTATATTAGG 
GTTACTGTGT 
GCCTGTGGGC 
TAGGGCTTTG 
ATTACCTCAT 
CTGAATTCTT 
CAGGTTCATA 
TGTCTAGGCT 
GGTGGTGCAT 
ACTCAATTGC 
ATACATGGCA 
CTGTTATTGT 
TATCCTTGAT 
AAAGCATTCA 
GCTTCACAAC 
CCTCACTCCG 
GCCCCGTTTT 
GAACCTTTCG 
GGAGGAATGA 
ATCTATTACA 
TTTATTTTGT 
CTGACTTCAG 
GTGTGATGGG 
CCATCTTTCC 
TATTTCATGG 
TGCAGTGGTG 
CATATGGGCC 
CCGAGCCTTC 
CAAACACCAA 
TTTGAACTCC 
CTAGTTACTA 
GTTATGTGCT 
AAGCCAACCC 
CTTAAATTTT 
GGACATGAGT 
TCTAGGGATC 
CATGCACAGA 
GGTGGAAGGT 



ATGGAGATGG 
TGTACCAGAG 
CTGAGTTCGA 
GGAATAAAGT 
CCAGCACTCC 
TTCAAAGTTA 
TCTCAAAAAG 
GTGTCTGGAC 
CTGAACTCAC 
CTGAGAAAAT 
TTACTTCAGC 
TCTTTTCGGT 
TGTGCCATAT 
CTCAAACCCA 
CTTCTTCCTT 
CCATCTTTTG 
GAGTTCCCTG 
ATTTTTTCTG 
AAAACAAACA 
GACCAACAAA 
ATCTTTCCTG 
TTCCCTGCCT 
TTTATGGTGG 
TATACCCTAG 
TTATTGATTT 
ATGGTTTAGC 
TCCAAGGCAA 
GGAGCAGAGT 
CTGCATCTTA 
GCATGTGTGT 
CATCAAGCTT 
AGGCTAGCTT 
CAGCTGAGCC 
CCTATTCTCA 
GTGTGGAACC 
TGTCATAACT 
TTTTTGGCCA 
AGATGAGGTT 
TGATGCTTTC 
CACAAACACT 
GAAAATGTAA 
TTTCCTCTAA 
ATCCACCAGA 
AATTGAACTC 
GGCCTTTATT 
GTG TAT AT AT 
CATGTATAAA 
CTGAGGATTA 
TCTCAACAGC 
CCCAGCCTCC 
CCATATAGCT 
TGCCTGGCTT 
TATAAGCCAG 
AGGTTACAGA 
TTGAGGTTGG 
TTGATTCCCT 
TAACGCCCTC 
AAACACATAA 
ACTTTTAAGT 



10200 
10260 
10320 
10380 
10440 
10500 
10560 
10620 
10680 
10740 
10800 
10860 
10920 
10980 
11040 
11100 
11160 
11220 
11280 
11340 
11400 
11460 
11520 
11580 
11640 
11700 
11760 
11820 
11880 
11940 
12000 
12060 
12120 
12180 
12240 
12300 
12360 
12420 
12480 
12540 
12600 
12660 
12720 
12780 
12840 
12900 
12960 
13020 
13080 
13140 
13200 
13260 
13320 
13380 
13440 
13500 
13560 
13620 
13680 
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a 
m 

25,j 

SI 
N 



AACATTCTAT 
TGTTTGTTTG 
TGTGTAGCGA 
TTAATATAGG 
GTGTATCTGT 
GAGAGAGAGA 
CTTCTACCCT 
CTGAGAACTT 
TTTGAATCCC 
CCTAAGCTGG 
CAGATGTGAG 
CTTTTCTCTG 
ACTCTCCAGC 
AGAGCGCTTA 
CCAAAAAAAA 
ATGAAAACCT 
ATAACC CCAG 
CTAGTTTAAT 
ATCTGT CAT C 
ACATATTTAT 
GCATACACAT 
ATAGTATACA 
TGGACACATG 
AGTCCAGGTT 
TTGCCTTTGA 
TCATCCGGAA 
TTGTAAAGCT 
GGCGTTTTCA 
TTCAAGGATA 
TTTCCCCAGC 
TTTTTTTTTT 
GCGCTTCCTA 
TCTTATAAAT 
TTATGGCTCA 
ATAGTCATGG 
CCACATCTTT 
GTGGTTCTGT 
TTGGGTATCT 
AGAGGCAAGT 



GTTATGGAAC 

AGGATGACCT 
CAAAATAAAC 
GTGTGTGTGT 
GAGAGAGAGA 
GTGGGTCCCA 
CTTGCTGGCC 
ATCTGAAGCT 
TTTAAAACTC 
CCCCAAGCCT 
TGTGTAGGTC 
TATTATTCTT 
CCTAGGAAGC 
AAAAAATGAA 
GATTCAATCT 
CACTGGGGAA 
TGGTGAGCTC 
AAGACCTGGC 
AAACAAACAT 
TAAAACACAC 
ACCATGCATA 
AACACATACA 
TCAAATATTT 
ACAGCGATCT 
ACAACCCCCA 
AAACACATTG 
ATCCAGTTAT 
TTTTTATTAT 
GGCTAATTTA 
TTTTTTTTTG 
GGTAAGCGCT 
GTCACTATGC 
ATACTGGTCT 
GTGGTATTTC 
AGATGCACTG 
GTTCAAACAT 
TCTAACTGGG 
AGATCCGAAT 



AAGTGCATTC 
GGAACAAGGT 
TGAATTTTTT 
TTTAAACTTT 
GTGTGTGTGT 
GAGAGAGAGA 
GGGTAAACTC 
TCACTCCCTA 
TGTTTTTGTT 
ACAGGAATTA 
GAGTGCTTCT 
TGATTAGTTG 
AAAATGAAGG 
GCAAGGCCCT 
GGGTCTGGTG 
GTAAAGCCCA 
ACAGAACAGG 
CAAGCTCAGT 
CTCCATACAT 
ATGCACACAC 
ACACACATAT 
AATGCACAGT 
TAT ATGC TAT 
ACAAACAGAA 
CTCGACACCT 
TGCACTCTAT 
TCTACTGCTA 
TTTTAGTGTG 
CCCCCAAATA 
AATTGCT TTC 
GGTTCTTTTT 
CTACCACTGA 
TGTATGTGTT 
ACTTATGGAT 
TACTTTGGGG 
ATGCATTCAT 
CAGAGGCACC 
TGGTGGTGGT 
TCTCGCCCTA 



AATTTTACTA 
CTTGTGTATC 
TATACTACTG 
GTTTGCTGTG 
GTGTGTGTGT 
GATTAGAGAA 
GGGTTATAAG 
TTTTATTTTA 
GTTTGGTTTT 
TCCACCTCCA 
GAAAGCTGCT 
TGGGGTTAGG 
GTCTGGGGGC 
GGGTTCGGTC 
GCTGAGGAAA 
CATAAAAGCC 
AGAATACCAA 
GAGACCCTGT 
ATATGCACAC 
TTGTGCATAC 
ATATATACAA 
CACACATATG 
ATCTTATATT 
AAGCGGGCTA 
GATCCCCGCA 
TGATTTTAAT 
AATACTTCAT 
TTCTTAGATT 
ACAGTATTTG 
ATGAATAGCC 
TTCGGAGCTG 
GCTAAATCCC 
CTTTCAGCAT 
CTACCACACT 
CTATTATAAG 
TTATCCTAAG 
ACCATTTATT 
ACATGCCTGT 
TAGTGAGTCG 



AGTTTTTAAT 
CCAAGCATCC 
CCTTCTTGAG 
CAGGTATATA 
GTGTGTGAGA 
TAACTTGTGG 
GCTTTGCACC 
TTGGTGGCAG 
TAAGGCAGTC 
CCTCCCAAGT 
TTTTTTTATT 
TGGTGTCAGC 
TGGGGATTTA 
CCCAGCTCCG 
AAGCTCAGTT 
AGGCATGGCG 
GAACTTGCTG 
CTCAAAAATA 
ATGTTACTCC 
ATGAACAGAT 
AAGTGTGTAC 
AATGCATTCA 
ACACTCCATT 
CTACCTGTAC 
GTGCTCCCTG 
AC TGGGGATT 
TCTTTGCCCC 
TAAGCATCCA 
TTAGGTGTAA 
TATTCTGGAA 
GGGAC CGAAC 
CAGCCCCAAT 
TGCAACACTT 
ATCTATCCAT 
CTTGCTAGGA 
AACAGATCCT 
TTATAATAGG 
AGTCCCAGCT 
TATTAGTCGA 



TTTAGCTTTT 


13740 


TCAAAGTTGT 


13800 


GGCAAGCATT 


13860 


TGGTGTGCAA 


13920 


GAGAGAGAGA 


13980 


AAGTTCTCTC 


14040 


CTTTTTCCCA 


14100 


TACTATTGCT 


14160 


TTAACTGTGA 


14220 


GTTGGGGTTA 


14280 


TCAAAACTAT 


14340 


ATGATCCATC 


14400 


GCTCAGTGGT 


14460 


AAAAAAAGAA 


14520 


GCAAAAAAAC 


14580 


GCATGCACCT 


14640 


GTCAGTCAGT 


14700 


AATGGAGATG 


14760 


CTCACATGAA 


14820 


ATCTATATTG 


14880 


AAACATAGGC 


14940 


TATTCACACA 


15000 


ACTATCCCCC 


15060 


TTTTTCCCAA 


15120 


CGGCAGAGCT 


15180 


ACCTGGAGCC 


15240 


TTTCCCATGG 


15300 


CTAGTACAGA 


15360 


CCTTGTAGTT 


15420 


AAGTAATTTT 


15480 


CCAGGGCCTT 


15540 


TCTGGACATT 


15600 


TGGTTCCTTT 


15660 


TCATCTCAAC 


15720 


GTATTTATGA 


15780 


GGATCATATG 


15840 


CATTTAAGAT 


15900 


CCTGGGAGGC 


15960 


C 


16011 


+11,795 (1st 


intron) 
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SEQIDN0:17 

The 5' (-5086 ) and 3' limits of the Human SM-MHC Promoter-Enhancer LacZ 
Transgene Tested in Transgenic Mice 

The number in the left margin refers to the position within an undefined BAC sequence contained in the 
public database (Accession # U91323 in GenBank). The start site (i.e. +1 position) of the SM-MHC gene 
corresponds to the BAC position 143,590. 

-5086 TTTAAA ATTATTAAAT CTTCTTTTTT TTTTTTTTGA 
13 8541 GATGGAGCCT CTCTCTCTAG CCTAGGCTAG AGTGCAATGG TGTGATCTTG GCTCACTGCA 
13 8601 ACCTCCACCT CCCAGGTTCA AGGGATTCTC CTGTCTCAGC CTCCCAAGTA GCTGGGACTA 
13 8 661 CAGGCGTGCA CAACCACACC CGACTAGTTT TTGTATTTTT AGTAGTGATA GGGTTTTACC 
138721 ATGTTGGCCA GGCTGGTCTC GAACTCCTGA CCTCAAGTGT TCCCTCCACC TTGGCCTCCC 
13 8781 AAAGTGCTGG GATTACAGGT GTGAGCCACT GTGCCCGGCC AAAAAATATT AAATCTTGAG 
13 8 841 GCACATGCAG GAGTAAGCCA TGCTCAGACC CAATCTTCGA TGTTACTAAA AATTGGAGGG 
13 8901 GATCACACTT CATGGTTTTG TTTTGTTTTG TTTTTTTGAG ACAGGGTCTT GCTCTGTTGC 
138961 CCAGGCTGGA GTGCACTGGT ACGATCACAG TTCACTGCAG CCTCAAACTC TGGGGCTCAA 
139021 ACAATCCTCC TACTTCACTC TCTAGTTGGG ACTACAGGCA CACACTGCTG TGCTCGACTA 
13 9081 ATTATTATTA TTATTATTAT TATTATTATT ATTATTATTA TTATTATTTT GTAGAGACAG 
139141 GGATCTTGCT ATGTTACCTA GGCTGTTCTT GAACTCCTGG GCTCAAGCGA TCCTTCCGCT 
13 9201 GCAGCCTCTC AAAGTGCTAG GATTACAGGC ATGCCCAGCC ACTTTGGGGC TTTTTTAAGC 
13 9261 CAACAGCAAA AAAAGACTAT AAGAGAGAAA TTTCCCCTTG GCTGTCTTGT TTCATGGATT 
139321 CGTGGAAACT CCCATTAAAC AGCCGGTCAC AGAAAAAGAT ATGCCAAGGA AAATTACTTG 
139381 ACAGCACTCA GTCAAAGTGA CATTTTAAAA AGAGACTATT GCCTCCTCCA TCTTAAAAGA 
139441 ACTGACCTTT TGAGCCATGA GAAATGAAAC AGAGGCATCT GAT CGAATGA TAACAATGCA 
139501 CTTCTGAAGA TTCAAACATC GGAACTTCAT GCATTGGACA CATATCTATT GAATGACTCT 
139561 TAAGTGAACA TACTGTCCCT GCCTGCTTCC AGAGGGTACT AGAGAGGTCG GAGATGGTTC 
139621 ATAAAGGCCT TCACATGTGC TGTCATATTT AACAATCAGA AAGGTACTTG AGGCAAAGAA 
13 9681 TCTGATCATC TTTGTTTTTC CTTGAGAAAA TGCGCTCAGA GAGGTTTACT GACAATCCCA 
139741 AAGGTGCTTG GTTGGTGCTT AAGAGATCTG GGTTTAAAAC CTCAGACTGC TGTCTACTAT 
13 98 01 GGCCTGTGTC AGAAAGACTG GGGTTGGAAT TCCTGTTCCA CCACTGCTGT GTTATTTAAC 
1398 61 CCCTCCAAAC CTAGATTCTC AACAATAAAA TGGGGGTAGG GAGGGAATTA AAGTATGTAC 
139921 CTTATTTTTT AGAGACAACA TCTTGCTCTG TCGCCCAGGC TAGAGTGCAG TGGTGCAATC 

13 9981 ATAGTTCACT GTAGTCTCAA CCTTCCAAGC TCAAGAGATC CTCCTACCTC AGCCTCCCTA 
140041 GTAGCTGGAA CTTCAGGCTA CACTACGCCC AGCTGCTATT TATTATTTAT TTATTTATTG 
140101 AGATTGCATC TCACCATGTT GCCCAGGCTG GCTACTTAAA AAAAATTTTT TTTTTCAAGA 
140161 CAGGGTCTCA CTCTGCCACC CAGGCTGGAG TACAGTGACA GAGTCTCAGC TCACTGCAAC 
140221 CTCTGCCTCC CAGGCTCAAG TGATCTTCCC ACCTCAGCCT CCCAAGGAGC TGGGATTACA 

14 02 81 GGTACCCACC ACCACACATG GCTAACTTTT TATTTTTTGT AGAGACAGGG TCTTGCTATG 
140341 TTGCCCAGGC TGGTCTCAAA CTCCTGAGCT CAAGCAATCC TCCTGCTTTG GCCTCCCAAA 
1404 01 GTGCTAGGAT TACAGTTGTG AGCCACCATG CCTGGCCTTG GCCACTTTAG TTTTGCTTTT 
140461 TTTTTTTTTT TTTGAGTTGG AGTCTTGCTC TGTCATCCAG GCTCCCAGGC TGGAGTGCAG 
14 0521 TGACACAATC TCAGCTCACT GCAACCTCTG CCTCCTGGGT TCAAGCAATT ATCCTGCCTC 
14 05 81 AGCCTCCCAA GTAGCTGGGA CCACAGGTGT GCACCACCAT GCCCAGCTAA TTTTTATATT 
140641 TTTAGTAGAA ATGGGGGTTT CACCATGTTG GCTAGGCTGG TCTTGAACTT CTGACTTCAA 
140701 GTGATCCGCC TACTTTGGCC TCCCAAAGTG CTGGGATTAC AGGCAAGAGC CACCGTGCCC 
14 0761 GGCTGCCTAC TTTAATTTTT AATAAAGGGT TGTTATATAA GGGGTAGGTG AGAGAATGAA 
14 0821 GTAAAATTGA GTGTTACAGT CTCCAGTTGT TAATCACATT ATAATTATTC TCTTTTAAAA 
140881 GTTACCAACA AGTTATTTAA AGAATCGAAT GGAACCCTTT GGAAATACAG TGTTCATGCC 
14 0941 TCTAGTATTA ATGCCAGTTT TTACTTCGAG GCCAGCAAGC TAGATTCCGA TGGCCTTCCC 
141001 TTTCCAGGAT GGGAAGCGGA TGATTGACTT CAATTTTCCC CCTTCCGTTA CTTCTCTGCT 
141061 CCACATCATT TCTGTGCTGA TGCAGGGACG ATTTCCACTC CTTTTACAGC GTAGATGTTA 
141121 AAAGCCTGTG CGGAGCAGCT CATTCATCAT TTTCCGCAGA GCTTTACCCC TCACTTCCCC 
141181 AGCCAGCTAA ATGCAGGCTG TTCTTGACTC TCTGATCTAG GCCCATTGCA GGGTGAGGGC 
141241 CAGGCTCAGG AGTTTCCAGG GTGAAAACCA GGTAAGCTTG ATGTTGGAAG GATGAAGAAG 
1413 01 GACCCAAAAG GGTCTGAGAT GCAGAGCTCT CCAGATGGGC CTGGGAGCCT GCAGGGGAAG 
141361 AGGCCTCTCT TTATATCCCG GAGGCCTGGT GCAACTC TAG TTGGTTTCAT GTTTGTTGCG 
141421 AGTAACAGCA GCTCACATGA AGCGGTGCAC CATGTTCATT TTACATGGAT TCATCTCAAG 
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141481 GACTGCTTAC AAAAAGGCCA GGAAGTAGCT GATGTTCTTC CCATCTTACA GGTAGGGAAA 
141541 TTGAGGCATG GAGAGGCAAA GTTACTTGCC CATGGTCATA TAGGTAGAAA GCAGCACTGG 
141601 CAGATTCAAA GCCAGACATC TACTCTCAGA TACACGCCCT GGGCCTCAAG GCCAGTTTGC 
141661 CTGGGCATTT CCCTTTAATG TCTCCTCTCT GGAAGTGAAT GGTGTCATCA GAAAGGTTCC 
141721 AGTGCCAGCA CCAATCAATG ACTGTCCCAG TGAGAGC TTG GTCAAATCCC TTTACCCCTG 
141781 CAGGGACTCA ATTTTCTCAC CTGCAAAATG GGGGTATTAA TAAAGCCACC CCCCGCACCC 
141841 CCGGCCCCCA GCCCCTCCAC CTGGTTGCAA GAGGAGTGGT TGTAGACTAA GGGCCTGCGT 
141901 CAAGTACAGA ACCCAGGAGG GGTCTGCCCA ACTTTAACCC TCTCTCCAAA TCCTCTAGCC 
141961 TGAAGCAGCA GAAACCCACG TGGGACTGGG GGCTGCCCCC TTCCGGGCCT TCCCCAAGCA 
142 021 GAGGGGTCCC CATCTAGCCC CGCGGGGCAA CGGCGGCCGG TGGCTGCGTG AAGGGCCCCC 
142 081 TCCCCCGACG CCGGGGAGCA GGAAGGCCAC TCGGCACCAT ATTTAGTCAG GGGGAGCCGG 
142141 CAGCCCAGAG CTGGTATGCG GCGCTGGGAA TTCCTGCAGG AAGGAGTCCG CGCCTGCCCT 
142201 TTTTGGGTTG TCTCCCGCCC GCCGCTCCCG CCGCTCCCGG GGAGGGGGAC CGGCCCGGCC 
142261 CGGCCCGGCC CGGGAACCTC GGAGGAGCTG GTGCCGCGCG GGGAGCGGAG CGCCCGGGCT 
142321 GCCCGCGGGT CCCCGGCCTG GCGCGGGGCC AGCCCACCGC CTCGACTTCC TTTTATGGCC 
142 3 81 TGTGTGTGCG TGCGTGGACA GGAGCGGGGA GGGAGGGACG GGGAGAAGAC GGAGAGCCTG 
142441 GGGAAGAGAG AGAGAGAAAG CGCAGAGATA GGAGTGAGAC ACGCGGGAGA GATGGAGAGC 
142 501 AAGAGACACA GAGACCAGAG ACAAAGTGAG ACAGGAGGGA GAGACAGATA CATCGACAGA 
142561 TCTAGAGAAG CGAGAGGGAC AGAGACAAAA GATAGAGCGA GAGACAGCAA TGAT CAGAGT 
142621 GACAGACATG CAGAGACAGT GGCAGAGACA GAGCGAGAGA GCCTGTGATG GAGAGAGACA 
142681 GGGAATGCAA TTTTAGGCGA GGAATCCTTG GGGAAGGGAA GTTGTTGAAG GGAACTCGCA 
142741 GACTCTGGGG GCACACCCAC TTTCTCCTTG GATCTTGACA CTTGCATCTT GTAAATAACG 
142801 TAATTATCAC CGCCACCGCC TTCCCCCATT TTGTAGCTAT GGACACCAAG TCTCAGAGAA 
142 861 GTGAAGTGAC TTGCCCAAGG TCACGCAGCT GGCGAGTGGC GCACAGGGGA GGGGGACAGC 
142921 TGAAATAATC ACAGTGGGCT TATTTTTAAT TTTTATTTGT ATTTTGGTCG TGGTGATGTG 

142 981 GGTGGAGGTG GAGATGGCAA GTTGGGAAAA GTAAAAACTT CCCCTTCCTG CACGGTTCCC 
143041 AGCAAGGGTG GGGGCCTCCT GTCTTGCACT TTGCAAAGTT CAAGAAATCC CCTTTCCCTA 
143101 CCCTTCACGC TGCACAGCCG GCCCTCTTTC CAGACAGTGC GATGCCAATA AAATGGGAAG 
143161 TGGGGTGGGA GATGTCAAGT CAGATCCACC ACAGCCCCGA CACGGGGAGG AAGAGGTTAA 
143221 AGC CTTTGCG GCCGGAACCG ACTCAGGGAA GACGTTCTCA AGCATCCCGC ACAGACACTG 
1432 81 CCTGCTCGAC CCCCTTTCTC TAGGGATCCG GAGCGTCTGC GACCGCCTGG GGCCGGGGCT 
143341 GAGACTCCCG TCCCTGTGCG CACCTGTTCC GTGCGCCCTT GTGCGGTGCG CACCTGTTCC 
143401 GTGCACCCTT GTCCCGAGCG CCCCAGCTCC TTGCGCTCCC GCCGGGGGTG CGCCCTGCAG 
1434 61 GGGGCGCGGC GAGGGGGCCG CGAGGGACCC TCCCCAACTC CACCCCTTCG GCCTCCTCCC 
143521 CTTTCCCAGC CGCGGGCAGC TCCGGGTCTA TAAAGAGAGG CGTCCGAGGA CGCGCAGGGA 

TRANSCRIPTION START SITE +1 

14 3581 GATTTGGACG EtCCGGCCTG GGAGGTGCGT CAGATCCGAG CTCGCCATCC AGTTTCCTCT 
143641 CCACTAGTCC CCCCAGTTGG AGATCTGTAA GTAGTAGTTG TCATTCTGGG GGCAGATTGC 
143701 AGGGCAGGGG GGTGTTAAAA GTCCTATAGG GTATTCTATA GGGGCTGGGG TGCACTTAGG 
143761 GGTCCCTGTT GTCAACCTCG TAAGGGCCAT GGTGGGGGCA GAGTTGTGAT TTGGATCTCT 

143 821 CTCTGCCTTA TCGTCTTAGA TTATCCTAGA CTTTCCCCAA ACAGCATTTC TTAAGATTGC 
143 881 CAGTGAGAAG TACCATTTTG GGGGTGCTTA TTAACGATAT CAATGCCTGG ACCCAACTCC 

143 941 ATTTCCCAAC TCTAGAATCC CCAGAAAAAC TGCCTTAAAA AAAAAAAAAT TAGTCCCGAG 

144 001 TGATTCTTGT TAAGAGGCTA ATCCAGGAGA TATGCTCCCT TGGAAATCTC AGAGGTCCGG 
144 061 TGCAGACAAT CAAGGCATCT CACTTTTATT CTAGGCACCA AAAAATTTAC AGCTGAACTT 
144121 CACTGAAAAG TCACTTGCTA TCACACAGAA GGGCAAAGTG AGGCTCCTTG TGGATTTGAC 
144181 CGTATTGCAC AGTTGTGTTG ATAATGCATT AAATCAGTTA AAAACACATG GGCATAGGCT 
144241 TAGCAGAAAG GAGTGTTGTT GTTTTTTTTT TTTAATCAGT TTAGGGGAGG TTCTTCTATG 
1443 01 TTGAGAACCC CTGGGAGATA AGGCTGGTTG TGATCTAGTT TGTTAC AGC C CACTTTTTCC 

1443 61 TCTTCTCCAA ATTAAAAAAA AAAAAAACAA CTCACCCAGG TTGACCCCAA AGGGCCCCCA 
144421 GATACCCAGG TGGGCTCCAA AGTCTCCATT TGCTTCCACG ATCTGCAGGT G CGTTAGGT A 

1444 81 AGATTACACT AGAATTTCCC GCAGAGCCAC CTGTGTCAAT GCCACTCTCG TGCCCAACCA 
144541 AATGGGTAAA ACGAGAGAAA GTGTGGCTAC TGC CTGTTGT AAGTTTTCTT CCAGCACAGG 
144601 GTCTGGTAGG GATTTTGCCA CTTGAGAAAA GGTACCATCC AAAGCCATGC TTGT CAAGAA 
144661 GTAAAAGAAA ATATTTAGAA ACCCAAGGTG GGAGTGTTTA GTTGCAGTAT GAAGAACTGA 
144721 GAGATTAAAT GGTGAACTGT CCGTCCGGGG TTTGGCAAAA AGAATGCAGG CTATTAATAA 
144781 ACTGCTTTGC ATAGTTTTTT GTTTCTTTGA TTTACTCAAC GATACTATTT TAGAATTGTT 
144841 CAGAGACGGA ACTTGACGCT GAACTGAAAG T CATTAGGTG GCAGGGTGTG AAATAAGATA 
144 901 GAGAATTTTG TTTGAAGGAA ATTGATGTTT TCCCTTTGAG ATAGCTACCG TTGATGGAAC 
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144961 ACTTCAGTGC CACATGCTGT TGCAACATTT 
145021 CAACTTCATA AGAAAGGCTT TATGATGCCT 
145081 GAGGTAAAGT GTCACACAGC CAGCAAGTGG 
145141 AAGTTCATAC CCTTGACCCC ACTATCTTTC 
1452 01 TCAGGTTTCT GGTGGGACTA AATGCTTCCA 
145261 GAAGAAAACA TAAACTGACT TTTTGCACAT 
145321 TTGAGGGGGT GACCTAGCTG AAACCAGTGA 
145381 GAACGTGAAC TAGCTGGGTC TTTTCCGAGA 
145441 TTTTTTTATT ATCATGTCCG AAATAGATGT 
145501 TTGTTATCAT TACTTTTCCG TGGGAGGGCA 
145561 AGCGTTTTCC ATGAGGCCAT CCCCGGCCCC 
145621 CCGGTTGGTC CTGGCCCTGG GGATTTGTCT 
1456 81 GGAGAAACCC TAAGGAGAAG AGATTTGGGA 
145741 CTTATTCAGA GGTTTGATTT TTTTGGGTGG 
145801 GGTCTTGAAT CCAAACAGGT GGGTCTTCCA 
145861 AGATATATAG TAGATTTTTC TTGAAAACCA 
145921 AAGGC CAT AA GCAAATTTAA AAATCATCTC 
145981 CGGGAAATGG GGCTAATGAG AATAACTCAT 
146041 CTAAACGCTT TGTGGACATT ATCTTACGTC 
146101 TATTCTAACT GGCTTTATTT TACACATATG 
146161 GCTAACCAGT AAGGAAAAGA AGATTCTACA 
146221 CAGATTACCC TCATGGGAGG ATTTGATGAG 
146281 CTGGCCCCTG GTAAGCTTCA GTGATGGTTA 
146341 ATGTTTGCTC AGTATGGCAT GGCTCATCTT 

1464 01 CCGTTAAAGG AGAAGCTTCT CTTAATTTTC 
146461 TTTGGGCTGA AAGTGGTGAT TAGAATCTTT 
146521 TTGGGAATGT TACAAATTCC AATTACATTT 

1465 81 CAGAGTCTTG CTCTGTCGCC CAGGCTGGAG 
146641 CTCCGCCTCC TAGGTTCAGG CACTTCTCCA 
146701 GTTTATAGGC GCCCACCACC ATGCCTGGCT 
146761 TTCACCATAT TGGCCAGGCT GGTCTCGAAC 
146821 TCTCCCAAAG TGCTCAGATT ACAGGCGTGA 
1468 81 ATGGCATACT TTAAGGGATG GCCTTTTTGC 
146941 TTTCAACTTT TAACCCTGGC CACACAATGG 
147001 AAGAATTTGA AAGAAACCTT GTATGTGATC 
147061 GCTGAAAGGG GAGGTCGGAC ATCTGTGACT 
147121 CAACCTCATC AAAGTTGCTA GGAAAGGGCC 
147181 TAATCCCAGC AATTTGGGAG GCTGAGGTGG 
147241 CAGAGTGGGC AACATAGCGA GGCACCATCT 

1473 01 ATGGTGGCAT GCATCTGTAG TCCTAGTTAT 
147361 TTGAGCCCAG GAGTTCAAGG CTGCCGTGGG 
147421 GCAACAGAGT GAGACTCTGT CTCAAAAAAA 

1474 81 GGCTGAAAAA TACAAGGCCC AGAGAGGG C A 
147541 TTGGGCAGAG CTGAGACTTG TAACTCGAAG 
147601 GCTTGTTTGT GCTCAAGGGA TGAAGCAGTG 
147661 GACATATTCA GATGTGTCAA ACATCGCTAA 
147721 AGATCGGGAT TCTAATCCTG GTTCCCCAAC 
1477 81 TCAAGACCCC GATAGGCTTA GCAGGAAAAT 
147841 CTTGGACTTT TTGTCTAATT ATATGAATTT 
147901 TGCTGAGTTA AAGGGCATGC GTATCTAAAA 
147961 ACATTGCAGA CCAGGTGCAG TGGCTCACGC 
148021 CAGGTGGGTC ACCTGAGGTC AGGAGTTCAA 
1480 81 ACCCTATCTC TACTAAAAAT ACAAAAATTA 
14 8141 CAGCTACTCG GGAGGCTGAG GCACGAGAAT 
1482 01 GAGCCGAGAT CACACCACTG CACTCCAGCC 
148261 AAAAAAAAAA AAGGCATTGC AAATTGCAAC 
148321 ATCCTCTTGG CACCAGAGAC TGGTTTCGTG 
14 83 81 AGGATGGTTT TGGGATGAAA CTGTCCCACC 
148441 AGGAACGTAC AACCTAGATC CCTTGCAGGT 



AACTTAATTT ATCTCATTTA ATCTTTGCAA 
GTTTAGTATA CAAGGCAGCT GAGGCTCAGA 
TAGAACCCAT TCCCGGGTCA GTTTGAGTCC 
TTCTTTACCA TGGACACAAA CTTGTTGGGG 
ACAAAGTAAA TGTTTATCAC CGTGTCCTTT 
TTAAAATAAA AGGCACTGTT TGTCCCCTGA 
CCCTAGGTGG GCTGCCATGC CGAGAGTCCA 
AGCCGCCAGG CTTGCCTTGT AAACACCATG 
GTTATTCCGT ACAAGGTATC TGTTATGGAT 
GAGATTGAGG CAAACATGCC CATTTATGGA 
CTCGTCAGTT ACCCAGCCTT GCACCGCAGC 
ACCATGTCCC TCACCCATTG AAGAACTAGT 
GGAAAGTGGG ATTCTTTTTT CCTACCCCCT 
GGGGTGGGAG GGAATTGTCT CCTTTCCACA 
CGTTAGGCAC AAGCGTGTAA TTCCAAGAGC 
AGTTCAATAT TCAATCCAGT AGAATCATAG 
CCGCACCTCC CCAAACCTCA CTTTCTCATC 
GTTTTTTGGG CACTTTTGCC TGGCGAGATG 
TTCATAACAA CCCTTTAGAG TAGATACTGT 
GAGTCTGAAT AACTTGCTTA AGATAGCTCA 
AATCTAGGTC TTTCTAACTC CAGAGTTT C A 
CTAATGTGTA TGAAGGGTTT AGCACAGTGC 
TTTATAGCAA ACACAAC CAG AGAGTTCAAG 
TGGCAGAACC GGGAAGCCTA AACTATGTGG 
TTCCCTTTGA TCTCATAAAC CTCGTTTCTA 
AATATATTAA GCTACCATTC CTTACCTGGA 
GTTTAGGGTT TTGTTTGTTT GTTTTTGAGA 
TGCAGTGGTG CGATCTTGGC TCACTGCAAC 
GCCTCAGCCT CCTGAGTAGA GAGTAGCTGG 
AATTTTTTGT ATTTTTAGTA GAGATGGGGT 
CGCTGACCTC AAGTGATTCG CCTGGCTTGG 
GCCACCGCGC CTGGCTTATT TAGGGTCTTG 
TCTCTAGGTC TTCTCCTTCC ACTCCTGACC 
AGGAAAGACT GAATTTAGAG AAAGGCAGGC 
CAAGGACAGA GGAAGAAGCT GCTCACAGTG 
TGTATCAGGG TTTCAGGGGC TAAGGAGGAA 
ATAGAGGCCA GGTATGGCAG GTCATACCTG 
GGGGATGGCT TGAAGTCAGG AGTTTGAGAC 
CTACAAAAAA ATTTTTAAAA TGAGCTGGGC 
TCAGGAGGTT GAGTGAGGCA GGAGGATTGC 
CCCTGATTGC ATCACTGTTC TCTAGCCTGG 
AGGTGAGGGG CATAGAACTT TACTGTACCA 
AGTGACTTGC CTAGC AT C AC CCAGCGAGTT 
ACCTAAGGAT CTTCCACAGG CTAATGAATA 
AGTTGTTAGG ACAGGACTGT GAATAGGGCT 
TGCCATCTCT GAGTAAATTA GGCTTCAAAC 
TTTTGCAAGG GAGGGCCTTG CATTTACCTT 
GGGAATAATA GATAATGCCA CTCTTTCATC 
ATCTGTAGGA TAAATTCCCA GAAATGCGCT 
TTAATAGATA TTGCAAATGA CTGGCTAAAG 
CTGTAATCCC AGCACTTTGG GAGGCCGCAG 
GACCAGCCTG GCCAACATGG TCTCTGCTAA 
TCTGGGCATG GTCGTGGGCA CCTGTAATCC 
CGCTTGAGCC TCAGAGGCAG AGGTTGCATT 
TGGGCAAAGA GTGAGACTCG GTCTCAAAAA 
TTGTTGCAGT CACATATGAC AGCAGTCCCC 
GAAGACAATA TTTTCCAGGG TGGAGTGGGG 
TCATCATCAG GCATTGGTTA GATTCTCATA 
GGAGTTGGCA ATAGGGT TTG TGCTTCTGTG 
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148501 AAAATCTAAT GCTGCTTATC TGACAGGAGG CGGAGCTTAG GCAGTGATGG TCACTCACCC 
14 8561 ACCGTCCCCT CCTGCTATGT GGCCTGGTTC CTAACAGGCC ATTGACTGAT ACTGCAGCAC 
14 8621 AAGGGTTGGG GACCCCTGAC ATAGGAGACT ATACATTTAT TTTAAGCTGT GGTATGCCAG 
14 8681 AATTGTAAAA TATAAAACAC AGTGGGGCTT TTAGGGCCAG AAATAATCAG TTCTTGCTCG 
148741 CTTCCAGAAG CATCCTTCAC AGGGGCTACC GTAACTCTTG CCAACCAAGT TCTCTTGGTT 
148 801 GGGAGGAAAA AATAGTGTTA TGCATTAAGA GAACTTCTTT CTGGAGTTAC TTGAAACCAT 
14 8861 TGGTATTCAG ATGATTAGGC AGATGTCACA AGGCAATAAG AATGTGACAG GTTCAC CAT T 
148921 CACTTTTTTT CCTGTAAAAG TGAAGTAGGG CTTTCTTGGG AACAAGCCCT TGGGAGGTGG 
148981 GGGGATGTGA ATGGTGAGGG GAGGGTAGAA ATGGTGGAGT AGGGTCAGGG GCAAGAAAGG 
149041 GACTTTCTGC TAAGAATTAA TCGGGTGTCC ATTTACTCTT AGCAGAAAAC TAGGATTAGA 
149101 TTCTGGATTG TACTCCTGAC TCCAAATTTT ACAAGTGGGG GTCTTGCATT TACCTTCCAG 
149161 GACCTCGGTC ATCTTAGCAG GAAAATAGCA ATAGCAGGTG ATGCCACCTT ACAGAGCGCT 
149221 TAGGAGACAG TGAGATGGTC TATATAGGAA GCTGTCTGGC CTGATACCTG ATGAATACAA 
1492 81 GGGGCCCAAT AAATACAGTG GCTGTTATGA ATAATAGATC TAAACTGCCT TTTTGGTACT 
149341 ACTGGGGACC TGC CAAGCAG GTGCATTTAG AGTGCCCAGT GCCTCTCCCT G CGAC ACATT 
14 9401 TGATGCCTCC CTACACCTGG ACCAGGCCTT GAGCGAGGAT TTCCACTGCA GAGGTCCTTC 
149461 CAGCTGGCGA ATTGTGTTGC AGATCAGGTT CAGAGAACTT CTGTTTTGCC TGTGTGGCAT 
149521 TCATTCATTC GTTTATTTGA AATAGAGATG GGATCTCACT GTGCTGCCCA GGCTAGTCTA 
149581 GAGCTCCTAA TTCAAGCAAT CCTCTTGGCT TGGCCTCCCA TAGTTCTTGG ATTACAGGTG 
149641 TGAACCACTG TATCCAGCCC TTTATGACAT TTAGAATATG AGCAATTTTT CTTTTTTCTT 
14 97 01 TTTTTTCTTT TTGAGATGGA GTCTCACTCT GTCACCCAGG CTAGAGTGCA GTGGCATGAT 
14 9761 CTTGGCTCAC TGCAACCTCT ACCTCCCAGG CTCAAGCGAT CTTCCCACCT CAGCCTCCCG 
14 9821 AGTAGCTGGG ACTACCGGCA TGTGCTGCCA TGCCTGGCTA ATTTTTGTAT TTTCTGTAGA 
14 9881 GATGGGGTTT CACCATGTTG CGCAGGCTGG TGTCAAACTC CTAAGCTCAA GCGAACTGCC 

14 9941 TGCCTTGGCC TCCCAGTGTT GGGATTACAG ACGTGAGCCA CAGTGCTGAA CCCTGCATGG 

15 0001 TATTTAGAAT ATAAGCAATA CTCTAACATC TGGTCTGGGT CACTCTGTAT TACTTACCTG 
15 0 061 ATCTCCAAAA ACATTTGGGT TTTTGTCTCT GGTCCAAAAT CTTTAGCCAA TGGCTTGGCA 
150121 GTAAAATCCT GAGGGAAGCT GTTGACCAGG TGAGGTGATG TGCAAATCCT ATACTCTCTG 
150181 GGCTCTGGGA TATTTAATTT AC TATTTATT TATTTATTTT CAAGACAGAG TTTTGCTCTT 
150241 GTCGCCCAGG CTGGAGTGCA GTGATGGGAT CTCAGCTCAC TGCACCCTCC ACCTCCTGGG 
150301 TTCAAGCGAT TCTCCTTCCT CAGCCTCCTG AGTAGCTGGT ATTACAGGCG CCCACCACCA 
150361 CACCTGGCTA TTTTTTGTAT TTTTAGTAGA GACGGGGTTT CACCATGTTG GCCAGGCTGG 
150421 TCTTGAACTA CTGAC CTCAG GTTATCCGCC TGCCTCGGCC TCCCGAAGTA CTGGGATTAC 
150481 AGGCATCAGC CACCATGCCC GGCCTAATTT ACTTTTTATT AATGCTGAAG CAGAGAGGGC 
15 0541 AAGATCTTTT GCCCCTGAGT TCTTCTGGGA AAAATGAAAC TGATGGTAAA ACAAACTAAA 

1506 01 GC AAC CTGAC ATTCTCAGTT GGTCCAGTTT CAGCCCTTTG ACTGGGAGTC ACAGACGGGT 
150661 CCCATAAAAT GGTAGAGCTG GGCCAGCCTA CCATTGATTT ATTTTCCCTA AATGAAAAAT 
150721 ACAAGGCCCA GAGAGGGCAA GTGACTTGTC CAGAGTCACC CAGCAGGTTT GGGGCAAAGC 

1507 81 TGAGACTCGT TACTTGACAT CCTAAGGTCT TCCAGAGGCT AATGATTAGC TTGTTTGTGC 
150841 TCAAAAAATG AAGCAGCCTG GGCGCGGTGG CTCATGCTTG TAATCCTAGC ACTTTGGGAG 
150901 GCTGAGGCAG GCAGATCGCT TGAGCTCAGG AGTTTGAGAC CAGCCTGGGC CACAAAGTGA 
150 961 GACCCCTGTC TCTACAAAAA AATGCAAGAA TTAAAAAATT AGCTGGGTGT TCTGGTGCGT 
151021 GCCTGTGATC CCAGCTACTT GGGAGGCTGA GGTGGGAGAA TGGCTTGAGC CTGGGAGGCA 
151081 GAGTTTGCAG AAAGCAGAGA TCGCGCCACT TCACTCTAGC CTGGGCAACA GAGCCAGACC 
151141 CTGTCTCAAA AAAAAAGAAT GAAGCAGTTG TTGGTCAGGA CAGGACTGTA AACAAGGCTG 
1512 01 ACACACTCAG ATGTGTCAAA CATCGCTAAT GCCAAAGGTG ACAGAGTCAT TTGTTTTCAT 
1512 61 CCAAACATTC GAGAAAGTTG GACGAGGTGA CTCACGCCTG TCATCCTAGA GCTTTGGGAA 
151321 GCCAAGGCAG GAGGATCATT TGAGATCAGG AGTTTGAGAC CAGCCTAGGC AAAATAGCAA 
151381 GACCCCCATC TCTACAAAAA ATAAGCCGGG CATAGTGGCC CACACCTGAG GTGGGAGGAT 
151441 CCCTTGAGCC CATGAGTTTG AGCCTGCAGT AAGCTATGAT TGCACCACTG CACTCCACCC 
151501 TGGGCATATA GTGAGACCCT TCCCCCAACC AAAAACATTG AGAGCAGCTC TTGATGAGTG 
151561 AACTGTACTT CGTGGTCAGC AGTTCTGGGT AGTAATTTCA GAGATGTCCT TTCAGCCCTT 
151621 GGAGCTGATG CAGGACCTTA AACATGAGCG ATGGTGGAGG AGGGAGGGTT GGGAAGGTGC 
151681 ATCAAGGTAG ATGAAGAGTG TCCCTGGGGT TGGGCCAACT GGCGGTCCGT CTCTGGTCCA 
151741 GTGTGTTCAC CTTGCCCCCG TCTGATCTTC TGCAGTTGGT ATTCCGAGTT GAGTTTGACT 
151801 AAGTGAGAGC TGCTCTCAGC TTTAACTGCC TTTCCCAAGA CAGCCCTTGT TTTTATTCTA 
151861 AAGCTGTGGT TCTCAACTGG AAGCAGTTTT GCCACCCCAG GGGACATCTA GCAGTGTCTG 
151921 GAGACATTTT TGATTGTCAT GAGTGGAGGA AGGGGTGCTA CTGGCATCAG GTGGGCAGAG 
151981 ACCAGGGATG CTGCGGAACA TCCCACAATG C AC GGAAGAG CTCCCCTCAC GACACAGAAT 
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152041 GACGCAGCCC AAGAGTCACA GTGCAGAGTT 
152101 ATCCCAGCAC TTTGGGGGGC CAAGGTGGGA 
152161 GCCTGGCCAA TATGGTGAAA CCTCATCTCT 
152221 TAGCGCATGC CTGTAGTCCC AGCTACTTGG 
152281 AGAAACGTGG AGGTTGCAGT GAGCTGAGAT 
152341 AGCGAGACTC TGTGTCAAAA AAAAAAAAAA 
152401 GTATTAGTAT TATTAATTTG TCAGGCTCAC 
152461 AGGAAAACAT TCTCAAAACA TTTGGCAAAT 
152521 GGCTCGTAGG AGCAAAACAG CTTTCAGGTT 
152581 TTGTGTGTTT TGTGTCTCAA TGAGGGAAAG 
152641 GAC CTTGACT GCGCCACTGT TTGAAGAGCT 
152701 GATATTTCAG GGAATTAGGG TTCCAAGGGG 
152761 GTCGAAGCTT CTCATCCCCC ATCTTGCTGT 
152821 AACTGTCAGA GACATCAAAG CCAGAGGGAA 
152881 ACAGAGACTG AATCTTTGCA CTTGATCTTG 
152941 GTCCCAGGCA ATGGAGGCGT GCGTGTCCTT 
153001 TTCTGACTTT TAAGGCAACT CATCCACATG 
153061 CACAGCGGCT CATGCCTGTA ATCCCAGCAC 
153121 AGGTCAGGAG TTCAAGACCT CAGCCTGGAC 
153181 TACAGGAAAA TAGCTGGGCA TGGTGGCAGG 
153241 GGGTGGGAGG ATCACTTGAG CCTGGGAGGC 
153301 TGCACTCCAG CCTGGGCGAC AGAGCAAGAC 
153361 GAAGTCCACT TTACTTGTCA TAGTGCTTAG 
153421 GGATGTAATT GGCTACCATC TGCACAAACT 
153481 TTAATGCTAC TGAACTGTGT TTAAGTGGCC 
153541 CAATTAAAGA TAAGAGGGAA GGAAAATGAA 
153601 ATGTGCAAAA AAGTGTGCAG CTTGATGAAT 
153661 GGTCTCACTC TGTCACCCAG GCTGCAGTGC 
153721 GACCTCCTGA GCTTAAGTGA TCCTCCCACC 
153781 GCACACTATC ACAACCGGTT AATTTTTGTA 
153841 GACCAGGCCG GTCTCAGCCT CCTGGGCTCA 
153901 GGTAATTAAA AAACATTTTT TCTTAGAGAT 
153961 AAACTCCTGG GCTCAAGTGG TCCTCCCATC 
154021 GTGAGCCATG TCACCTGGCC CAACAGTTTG 
154081 GCTGGCATTC AGATGAAGAT CTAGAGGTCA 
154141 TTGCAATCAT TGCACACCGG AGACTCATTC 
154201 TAATTCTGCC TGGTTTTGGA CCTCAGTTCA 
154261 ACCTTCTTTT GCTCAAGGAT GTGTTGTGAG 
154321 TTACTTCACC TGATTCGAGT CCTATTTTGG 
154381 GTGCCTTACA GGATTGATTC CTTTATGGGC 
154441 GGCAGAACCA AATAGACTCC AGAATGAGAC 
154501 TTTTTTTTGA GATGGAGGCT GACTCTGTCG 
154561 GCTTACGGCA ACCTCCACCT CCCGGGTTCA 
154621 GCTGGGGCTA CAGGCACGTA CCACCAATCC 
154681 AGAGACAGGG TTTCACAATG TTGGCCAAGC 
154741 TCCCACCTCA GCCTCCCAAA GTTCTGGGAT 
154801 TTGTGATCAA TCTTACTTCA TCTTCACACC 
154861 CTCTCTCTCT CTCTCCTTCT TTTTCTCTCT 
154921 TGCGACACCT TAAGAAACAA GAGATTATCA 
154981 TATTTTGCTG CCTGTTTAAA CTAACCTTGG 
155041 AAGCCACTTC TGTGACTATG GCTGTCCAGA 
155101 TAAATGCTAT TGGTTAGGAA TGAGCGAAGT 
155161 ATAGAAACAG AAGGTACTTG GTGTAGATCA 
155221 CACCCCAGAA AAAGAAGAAT TGTCTTGGGC 
155281 GCTGATGAGC TAAAAAAAAA AAAAAAAATC 
155341 TTTATGAATT TGTATCGGGC CACATTCAAA 
1554 01 TGCAGGTTAG ACAAGCTTGG TGTAGAGAGT 
155461 GCACCCGTTA GGTCCCCAGT ATTAATATAC 
155521 CACCCAGAAG CACGCATTCT GCAGTGGCAG 



TGTGGCCAGC TGCGGTGGCT CACGCCTGTA 
GGATTGCTTG AGGCCTGGAG TTCAAGACCA 
ACTAAAAATA CAAAAATTAG CCAGGCATGG 
GAGGCTGAGG CACGAGAATC ACTTGAACCC 
TGCGTCACTG CACTCCAGCC TGGGTAACAG 
AAAAAAAAGA CTTAGCAACT ATTATTACTA 
TGAATTTTCT CAAAAATTTG GCAAATTTTT 
CTGTGGCTAA ATGTTGTTTT GGGGACCCAA 
TCCGGATCTG CCAGAGACTC AAGTGTCCTG 
GGGAATATGT AGCACCTTCC AGATGGATTT 
TCTCAACCTC CGCAGCTCCA CCCCAGCCCA 
CATGCTATGG AAAACACCAT TCTAGCATGA 
CTTTTGACCA AAGCAGATTT TGCACGTCGT 
TCCAGCCTGC TCCAAGCTCT CCTTTTTTGT 
TTTGTGTTTT TAAGTCTGAG GTTAGACAGG 
TTATTTTTCT GTTGTAGCTT TTGCTATTTT 
GCAATTAGGA AGAGCCCACT TAGGGCTGGG 
TTTGGGAGAC CGAGGCAGGC AGATCACTTG 
AACATGGTGA AACCCCGTCT CTACAAAGAA 
TGCCTGTGGT CCCAACTATT TGGGAGGCTG 
GGAGGTTGCC GTGAGCTGAG GTCATGCCAC 
CCTGTCTCAG AAAAAAAAAA AAAAAAAAAA 
AACAAATGAA ACACTCTCCT AGCCCTCTTG 
CTTCATTATT GCACAAGAAT ATCAATATAC 
GAGGTGGTGA ATGTTAGCTG TATTTTACCA 
GTGTACTTTA CAACCAAAAA AGTACGCTTG 
TTTCAAGAGG ATATATTTTT TATAGATGGG 
AGTGGCATGA TCATGGCTCA CTGCATCCCC 
TCAGCCTCCT GAGTAGCTGG GACTGCAGGT 
TGTTTGCTAG AGACAAGGTT TCACCATGTT 
GGTTATCCTC CTACCTCAGT CTTCCACACA 
GGGTCTTGCT GTGTTGGCCA GGCTGGTCTC 
TTGGCTTCTC AAAGTGCTGG GATTACAGGC 
ATGAATTTTC AGAAAGTGAA CACTCATAGG 
ACCCTCACAA GCCCCCCTCA CGTTCTGTCC 
ATTCCTTATC TGAGTTCTAT CACCGTAGAT 
ATAGTCACAG AACCTGTGCT TTTTGTGACC 
ATGTCCTTTT TTGTGGTGTG GAGCTGTAGT 
GTGTTTGTAA TGTGTCAGGT ACTGTGC GAG 
ATCTGACAAG CCCACCCACC TTATGTGAAA 
CCAGGTTTGG GTCCCAGCTC TGACACTTCT 
CCAAGGCTGG AGTGTAGTGG TATGATGTCG 
AGTGATTCTC CTGACTCAGC CTCCCAAGTA 
TGGCTAATTT TTAATTTTTG TATTTTTAGT 
TGGTCTCAAA CTCCTGACCT CAAGTTATCC 
TATAGGCATG AGCCATCACA CTCGGCCTAC 
CTCCCATTTC TCTTACGCAT CCTCCAGTTT 
CTCTCTCTCA CACACACACA CACGATCTGC 
GGGAATGATT GAATATTTTG CCGCATTTCC 
TTATACTATT AAAAGAAGAC GCGTCGTATC 
AATAAACATA ATTAAAACAT CCAACAGTAG 
GGCTTAGAGT CACCGGAAGT GAGAAAGGGT 
GGGGTGTCCT ATCTTTTGGC TTCCCTGGGC 
CACACGTAAA ATACACTAGC ACTAATGATA 
GCGAAAAAAT ATCATACTGT TTTAAGAAAG 
GCCGTCCTGG GCCCCATGCA GCCTGTGGGC 
TTCATCTAAA CTTCATGGCA GCTCTGCAGG 
AGTAAATCTG AGTCTCAGAT CTACGTAAGT 
AGTCACGTTT GAATTAGCAT CTGATTGCAA 
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155581 AGTCTGGGTG TCTTTACATG ACTACAGGTT ATCTTACCTC TCAAGAGGAG GCAACCAATC 
155641 AAATGTTGCC AGCACCAATG AACTTGTACT TTATTTAGGC TCAGAAAGAT CTTTTAGGCT 
155701 AATGAAAATG CCCTATATTT ATGAAATGTT CTCGTTCTCT GTGGCTTTCT CTTTTTTGAG 
155761 ACAGGGTCTC ACCCTGACAC CCAGGCTGGA GTGCAGTGAT GTAATCATAG CTCACTGCAG 
155821 CCTCAAACTC CTGGGCTCAA GCAACCCTCC TGCCTCAGCC TCCTAGTAGC TGGGACTACA 
155881 AGCACGCATC ATCATGCCTG GCTGATATTT TTTTTAAGGG ATGGGGTCTT GCTATAATGC 
155941 CCAGTCTGGT CTCGAACTCC TGGGCTCAAG CAATCCTCCT GCCTTGGCCT CCCAAAATAT 
156001 GGGATTATAC ATGTGGGCTA CTGCCAGCCT CTTTTCTTTC AATTATTTTT TAATCTATGG 
156061 GTTCCCCTCC TTTTTGTTTG TATTTTATTT GTTAAAGAAA GAGAGTACTG GCCGAGCGTG 
156121 GTGGCTCACA CCTGTAATGT CAGCACTTTG AGAGGCCAAG GCCGGTAGAT CACCTGAGGT 
156181 CAGGAGTTTG AGACCAGCCT GGACAATATG GTGAAACCCC GTCTCTACTA AAAATACAAA 
156241 AATCAGCCAG GCGT GGTGGC ATGCACCTGT AATCCTAGCT CCTCGGGAGG CTGAGGCAGG 
1563 01 AGAATCACTT GAACCTAGGA GGTGGAGGTT GCAGTGAGCC AAGATCCCGC CATTGCACTC 
156361 TAGCTGGGCG ACAGAGCATA GTCTCTCACC TTTGGGAGTT TACTGCATTG TTTAGCATGC 
156421 TCTCCTGTGC CTTGCATTTT CCATAGACAG GCGTCAGATC TGGAGGCTTC ATCACCTTCA 
156481 TCCCCCATCT CCATCCCCTT TTCTTTTGAG CAAGAATATG TCATTAGTGG TAACGGCACT 
156541 TCCTGTAGTG GCCCATCTGC AGGCATGTAA TGTTTATAAT GTCTAGTCAG CTCTCTCTTT 
156601 TTGTGATGTT AGGGTTAATT AGTAGATTTA GGTGATGGCA GGCGGACCCA TCCCTTAAAA 
156661 ATTCCACAAG AGCTCTTCAT CTGATATAGT CAGTCTTGTG GTGGGGACCC TAGACCAGCA 
156721 TCATCATCAT CACCCGGAAG CTGGTTAGGA ATGCATATTC TTGGGCCCCA TCCCAGTCCT 
156781 ACTGACTCAG AAGCTAATGC ACCAGGAAAT GTGAGCCCCA TTGGCCTAAT GGTTTTAGCA 
156841 ATTACTGGTA GAACTTGCCA ACTTGCCAAG ACCCTTTCTT TCTTCCTTTC TTTCTTTTTT 
156901 TTTTTTTGAG ACGGAGTCTC ACTCTGTCGT CCAGGCTGGA GTGCAGCGGC GCATCTCCAC 
156961 TCACCCACTC ACTGCAAGCT CCGCCTCCCA GGTTCACACC ATTCTCCTGC CTCAGCCTCC 
157021 AGAGTAGCTG GGACTACACG CGGCCGCCAC CACGCCCGGC TAATTTTTTT TTTTTTTTTT 
157081 AGTAGAGACA GGGTTTTGCC GTGTTAGCC +13518 
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